NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Zeroth-Order Optimization Finds Flat Minima

Zhang, Liang; Li, Bingcong; Thekumparampil, Kiran_Koshy; Oh, Sewoong; Muehlebach, Michael; He, Niao (June 2025, https://doi.org/10.48550/arXiv.2506.05454)

Zeroth-order methods are extensively used in machine learning applications where gradients are infeasible or expensive to compute, such as black-box attacks, reinforcement learning, and language model fine-tuning. Existing optimization theory focuses on convergence to an arbitrary stationary point, but less is known about the implicit regularization that provides a fine-grained characterization of which particular solutions are reached. This paper shows that zeroth-order optimization with the standard two-point estimator favors solutions with small trace of Hessian, a measure widely used to distinguish between sharp and flat minima. The authors provide convergence rates of zeroth-order optimization to approximate flat minima for convex and sufficiently smooth functions, defining flat minima as minimizers that achieve the smallest trace of Hessian among all optimal solutions. Experiments on binary classification tasks with convex losses and language model fine-tuning support the theoretical findings.
more » « less
Full Text Available
DPzero: Dimension-independent and differentially private zeroth-order optimization

Zhang, Liang; Thekumparampil, Kiran Koshy; Oh, Sewoong; He, Niao (July 2024, International Conference on Machine Learning (ICML 2024))

Full Text Available
A Discrete-Time Switching System Analysis of Q-Learning

https://doi.org/10.1137/22M1489976

Lee, Donghwan; Hu, Jianghai; He, Niao (June 2023, SIAM Journal on Control and Optimization)

Full Text Available
Sample Complexity and Overparameterization Bounds for Temporal-Difference Learning With Neural Network Approximation

https://doi.org/10.1109/TAC.2023.3234234

Cayci, Semih; Satpathi, Siddhartha; He, Niao; Srikant, R. (May 2023, IEEE Transactions on Automatic Control)

Full Text Available
Global Convergence and Variance Reduction for a Class of Nonconvex- Nonconcave Minimax Problems

Yang, Junchi; Kiyavash, Negar; He, Niao (October 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
A Catalyst Framework for Minimax Optimization

Yang, Junchi; Zhang, Siqi; Kiyavash, Negar; He, Niao (October 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
The Devil is in the Detail: a Framework for Macroscopic Prediction via Microscopic Models

Yang, Yingxiang; Kiyavash, Negar; Song, Le; He, Niao (October 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Optimization for Reinforcement Learning: From a single agent to cooperative agents

https://doi.org/10.1109/MSP.2020.2976000

Lee, Donghwan; He, Niao; Kamalaruban, Parameswaran; Cevher, Volkan (May 2020, IEEE Signal Processing Magazine)

Full Text Available
Sample Complexity of Sample Average Approximation for Conditional Stochastic Optimization

https://doi.org/10.1137/19M1284865

Hu, Yifan; Chen, Xin; He, Niao (January 2020, SIAM Journal on Optimization)
null (Ed.)
Full Text Available
Point Process Estimation with Mirror Prox Algorithms

https://doi.org/10.1007/s00245-019-09634-6

He, Niao; Harchaoui, Zaid; Wang, Yichen; Song, Le (November 2019, Applied Mathematics & Optimization)

Full Text Available

Search for: All records